A General Method for Statistical Performance Evaluation

نویسندگان

  • Longzhuang Li
  • Yi Shang
  • Wei Zhang
  • Hongchi Shi
چکیده

In the paper, we propose a general method for statistical performance evaluation. The method incorporates various statistical metrics and automatically selects an appropriate statistical metric according to the problem parameters. Empirically, We compare the performance of five representative statistical metrics under different conditions through simulation. They are expected loss, Friedman statistic, interval-based selection, probability of win, and probably approximately correct. In the experiments, expected loss is the best for small means, like 1 or 2, and probably approximately correct is the best for all the other cases. Also, we apply the general method to compare the performance of HITS-based algorithms that combine four relevance scoring methods, VSM, Okapi, TLS, and CDR, using a set of broad topic queries. Among the four relevance scoring methods, CDR is the best statistically when it is combined with a HITS-based algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance evaluation of the factors influencing Tourists General Satisfaction in the Border Cities. Baneh Border City

According to WTTC (World Tourism and Travel Council) forecast tourism contribution of global GDP will be about 6000 billion dollar in 2020 and will create 300 million job Therefore, Tourism could be considered as multidimensional field that response to tourists needs with diversity interests and motivations. Shopping is the most necessity needs and it is the one popular activity for tourists. B...

متن کامل

Intelligent Health Evaluation Method of Slewing Bearing Adopting Multiple Types of Signals from Monitoring System

Slewing bearing, which is widely applied in tank, excavator and wind turbine, is a critical component of rotational machine. Standard procedure for bearing life calculation and condition assessment was established in general rolling bearings, nevertheless, relatively less literatures, in regard to the health condition assessment of slewing bearing, were published in past. Real time health condi...

متن کامل

Evaluation of Unit's Performance in Presence of Subunits by Using GDEA

Data Envelopment Analysis (DEA) is a technique that uses all collected observations to measure performance. This method presents no data about how to operate on DMU. The present research attempted to study a unit with all its subunits, if the unit is efficient, it means that all its subunits are efficient too and if it is an inefficient, it shows clearly that which one of the subunits makes ine...

متن کامل

شناسایی شاخص‌های کلیدی سنجش عملکرد افراد برای پرداخت پاداش

The current performance evaluation process are inspired by performance management within the organizations to take a step further in a way to the factors such as competency, merit, capacity for improvement and promotion are supposed to be taken into account the in addition to performance evaluation itself. Nowadays, the organizations prefer to set more accurate criteria for performance evaluat...

متن کامل

Evaluation and Statistical Validation of Black-Spots Identification Methods

Despite the identification of crash hotspots as a first step of the roads safety management process, with various effective black spots identification (HSID) methods, only a few researchers have compared the performance of these methods; also it is not clear which test is the most consistent in the black-spots identification. In this research, seven commonly applied HSID methods (accident frequ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003